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Abstract In this paper, the problem of non-rigid shape recognition is viewed from the 
perspective of metric geometry, and the applicability of diffusion distances within the 
Gromov-Hausdorff framework is explored. While the commonly used geodesic distance 
exploits the shortest path between points on the surface, the diffusion distance averages 
all paths connecting between the points. The diffusion distance provides an intrinsic 
distance measure which is robust, in particular to topological changes. Such changes 
may be a result of natural non-rigid deformations, as well as acquisition noise, in the 
form of holes or missing data, and representation noise due to inaccurate mesh con¬ 
struction. The presentation of the proposed framework is complemented with numerous 
examples demonstrating that in addition to the relatively low complexity involved in 
the computation of the diffusion distances between surface points, its recognition and 
matching performances favorably compare to the classical geodesic distances in the 
presence of topological changes between the non-rigid shapes. 

Keywords Non-rigid shape matching. Diffusion geometry, Gromov-Hausdorff 
distance. 


1 Introduction 

Non-rigid shapes are ubiquitous in the world we live in, from microscopic bacteria to 
tissues and parts of our body. Since pattern recognition applications need to deal with 
objects encountered in everyday life, non-rigid shape analysis has become important in 
many modern applications, such as object retrieval and recognition, surface matching, 
navigation, and target detection and recognition. One of the cornerstone problems in 
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the analysis of non-rigid shapes is the problem of shape similarity: given two objects, 
we need to tell how similar or dissimilar they are. This can be quantitatively expressed 
as a distance between two shapes. The main difficulty in such a comparison stems from 
the immense number of degrees of freedom present in the problem as a result of possible 
deformations that the non-rigid shapes can undergo. 

A common way the problem of non-rigid shape similarity has been approached in 
the pattern recognition literature is to try to find a representation of shapes which is 
invariant to a given class of deformations [37]. Using such a representation, it is then 
possible to compare shapes regardless of their deformations, when these deformations 
are limited to the given class. Riemannian geometry is of help in finding such invariant 
representations [15]. It is well-known, for example, that the intrinsic properties of a 
shape remain invariant under inelastic deformations, i.e., deformations that do not 
“stretch” or “tear” the object. Many recent papers, e.g., [16,17,20,26,29,35,36,38-40, 
42,45], exploit this fact in order to construct deformation-invariant shape distances 
and surface matching techniques. 

Elad and Kimmel [16] introduced a method for the recognition of 3D shapes based 
on Euclidean embedding, extending previous efforts by Schwartz et al. ]42] (see also 
[45]). The key idea of the method is to consider a shape as a metric space, whose metric 
structure is defined by the (natural) geodesic distances between pairs of points on the 
shape. ^ Geodesic distances, being an intrinsic property of the shape, are invariant 
to any inelastic deformation the shape can undergo (or, using metric geometry ter¬ 
minology, we can say that such deformations are isometric or metric-preserving when 
considering geodesic distances). Two non-rigid shapes are compared by first having 
their respective geodesic metric structures mapped into a low-dimensional Euclidean 
space using multidimensional scaling (MDS) [14], and then rigidly matching the result¬ 
ing images (called canonical forms). MDS allows to “undo” the non-rigid deformations 
of the shapes, leading to a bending-invariant shape comparison framework based on 
the pairwise geodesic distances. This method has been used in three-dimensional face 
recognition [4], analysis of articulated two-dimensional shapes and images [27,28], tex¬ 
ture mapping and object morphing ]7,19,49], and shape segmentation ]23]. 

Due to the fact that the canonical forms method uses an intermediate metric space 
to compare two shapes, an inaccuracy is introduced, as it is theoretically impossible 
to embed a generic metric structure into a finite-dimensional Euclidean space without 
distorting it. It was shown empirically in [8,46] that using spaces with non-Euclidean 
(non-flat) geometry makes it possible to obtain more accurate representations, but can 
not avoid the error completely. 

In [35], Memoli and Sapiro proposed a metric framework for non-rigid shape com¬ 
parison based on the Gromov-Hausdorff distance. This distance was introduced by 
Mikhail Gromov, [18], as a way to compute similarity between metric spaces. Using 
the Gromov-Hausdorff formalism, the comparison of two shapes can be posed as direct 
comparison of pairwise distances on the shapes (basically, in the discrete case, the com¬ 
parison up to permutations of the corresponding pairwise distance matrices or their 
corresponding submatrices). Since no fixed intermediate space is used, the representa¬ 
tion error inherent to canonical forms can be avoided. The Gromov-Hausdorff distance 
computation is an NP-hard problem, and together with a number of theoretical results, 
Memoli and Sapiro proposed a practical approximation scheme (with explicit proba- 

^ Recall that the geodesic distance between two points is the length of the shortest path, 
traveling on the surface of the shape, that connects the points. 
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bilistic bounds connecting the approximation to the actual Gromov-Hausdorff distance 
and the number of available sample points). 

According to an alternative but mathematically equivalent definition [11,35], the 
Gromov-Hausdorff distance computation can be posed as measuring the distortion of 
embedding one metric space into another. Bronstein et al. [5,6], observing the connec¬ 
tion between this formulation of the Gromov-Hausdorff distance and MDS, proposed an 
efficient computation of such embedding based on a continuous optimization problem 
(energy minimization), referred to as generalized MDS (GMDS). This method follows 
the line of thought of embedding into non-Euclidean spaces,^ and can be considered a 
natural extension thereof. This is the computational technique that will be exploited 
for the examples in this paper. 

All the aforementioned contributions considered only geodesic distances as the in¬ 
variant used to intrinsically compare non-rigid shapes. This is motivated by a number 
of fundamental reasons. First, many natural object deformations can be approximated 
as inelastic ones. Thus, methods based on geodesic distances allow good shape recog¬ 
nition accuracy. Second, there exists a plethora of efficient numerical methods for the 
computation of geodesic distances for diverse shape representations [3,24,33,34,44,48]. 

The notable drawback of the geodesic distances is their sensitivity to topological 
transformations. By modifying the connectivity of the shape, one can significantly alter 
the paths between points, and in particular the shortest one, which in turn, can result 
in significant changes of the geodesic distances. Inconsistent topology or “topological 
noise” are common phenomena in shapes acquired using 3D scanners or obtained as 
a result of point cloud triangulation [47]. Thus, in order to be able to deal with real- 
life data, it is important for shape similarity methods to be topology-invariant (or at 
least topologically robust). Of course, topological changes can result from natural non- 
rigid deformations as well, e.g., bending of an open hand until the finger tips touch. A 
practically useful shape recognition system should be able to somehow deal with these 
transformations. 

It should be noted, however, that the metric model of shapes allows to represent a 
shape as a generic metric space with any metric. Also, neither the canonical forms nor 
the Gromov-Hausdorff distance are necessarily limited to geodesic distances. Using an¬ 
other intrinsic distance, insensitive or robust to topological changes, instead of geodesic 
ones, could potentially make these methods cope with topological noise and topolog¬ 
ical deformations in general. In [9,10], Bronstein et al. showed that using Euclidean 
distances (which are robust to topology changes but not invariant or even robust to 
non-rigid deformations), and geodesic distances (which are invariant to non-rigid de¬ 
formations but not to topology changes) together in a framework resemblant both of 
the GMDS and iterative closest point (IGP) algorithms [2,12], allows to obtain a shape 
similarity method more robust to topological changes than the one obtained with the 
geodesic distances alone.^ 

Motivated by the generality of the metric framework, and the need to add topologi¬ 
cal insensitivity to non-rigid shape recognition, in this paper we propose to use diffusion 
geometry in the Gromov-Hausdorff framework. Diffusion distanees, introduced by La- 

^ Instead of embedding each shape into Euclidean, hyperbolic, or spherical spaces, as clas¬ 
sically done in MDS, the embedding is done from one shape into the other. 

^ Also, it was shown in [9,10] that the Gromov-Hausdorff type distance between shapes 
modeled as metric spaces with Euclidean geometry allows to obtain an alternative formulation, 
and then use GMDS as a computation method for the IGP method, a classical approach for 
rigid shape comparison (for more detailed analysis, see [32]). 
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fon et al. [13,25], are related to the probability of traveling on the surface from one 
point to another in a fixed number of random steps (random walk). There are a number 
of reasons that lead us to prefer these distances over minimal geodesics when we deal 
with topological noise and changes. First, the diffusion distance is an average length of 
paths connecting two points on the shape, while the geodesic distance is the length of 
the shortest path. This naturally makes diffusion distances less sensitive to topological 
changes [29,41]. Secondly, both diffusion and geodesic distance are intrinsic, thus in¬ 
variant to inelastic deformations. Finally, diffusion distance can be efficiently computed 
from the eigenvalues of a discrete approximation to the Laplace-Beltrami operator (or 
simply the eigenvalues of a weighted connectivity matrix). The reader is referred to 
[22,30,41] for some recent works on 3D shape recognition based on spectral methods, 
both for geodesic matrices and the Laplace-Beltrami operator, which is closely related 
to the diffusion distance [21]. The diffusion distance was also exploited for 3D point 
cloud recognition using the framework of distance distributions in [29]. Combining the 
Gromov-Hausdorff framework with diffusion distances leads to a non-rigid shape com¬ 
parison and matching approach which is robust to topological alterations such as holes 
and point-wise connectivity changes, as demonstrated in this paper. 

The remainder of this paper is organized as follows. In Section 2 we present the 
metric approach to shape recognition and the Gromov-Hausdorff framework. Section 3 
describes the basic diffusion distance theory. Section 4 is devoted to the numerical com¬ 
putation of the proposed shape distance based on the GMDS algorithm, and Section 5 
presents experimental results. Finally, Section 6 concludes the paper. 


2 Metric approach for shape recognition 

In this section we present some basic concepts that constitute the core of the metric 
approach for shape recognition, staying mostly at the intuitive level. For a rigorous 
and insightful treatment of the topic, the reader is referred to [11]. 


2.1 Basic notions in metric geometry 

We model a non-rigid shape as a metric space {X, dx), where X is a two-dimensional 
smooth compact connected and complete Riemannian surface (possibly with boundary) 
embedded into R^, and dx : X x X ^ R is a metric measuring distances between pairs 
of points on X. The key idea of the metric approach is to compare shapes as metric 
spaces. Two shapes {X,dx) and (Yjdy) are similar if the metrics between pairs of 
corresponding points on X and Y coincide, i.e., there exists a bijective map ip : X ^ Y 
such that dy o [ip x tp) = dx- Such a (p is called an isometry and X and Y in this case 
are said to be isometric. Isometry implies that in terms of intrinsic metric geometry, 
the two shapes are indistinguishable and thus are equivalent. 

The notion of isometry can be relaxed in order to define similarity of shapes. We 
will refer to a set C C X xY of pairs such that for every x £ X there exists at least one 
y € Y such that {x, y) £ C, and similarly for every y £ Y there exists an x £ X such 
that (x, y) £ C, as a correspondence between X and Y. Note that a correspondence C 
is not necessarily a function. We can define the distortion of the correspondence as the 
discrepancy between the corresponding metrics, 

dis(C') := sup \dx{x,x') - dy{y,y')\. 

{x,y),{x',y')£C 
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We say that the shapes X and Y are e-isometric if there exists a correspondence C 
with dis(C') < £. Such a C is called an e-isometry, e-isometry can be regarded as a 
criterion of shape similarity. For small values of e, the shapes are similar, and for large 
values of e, the shapes are dissimilar. 


2.2 Gromov-Hausdorff distance 

A very elegant framework to represent similarity of metric spaces as a distance was 
proposed by Gromov [11,18] and introduced into the non-rigid shape recognition area in 
[35]. For compact surfaces and our shape recognition framework, the Gromov-Hausdorff 
distance can be expressed in terms of the distortion obtained by embedding one surface 
into another, 

dGn{X,Y) := linfdis(C), 

where the infimum is taken over all correspondence C, and dis (C) is the distortion 
dehned above. 

The Gromov-Hausdorff distance is a metric on the quotient space of metric spaces 
under the isometry relation, and thus, in the context of the metric space model for shape 
recognition, is a good candidate for a shape distance [35]. Being a metric particularly 
implies that dGH(-^iy) = 0 if and only if X and Y are isometric. More generally, 
if dGH(-^i^) < then X and Y are 2e-isometric and conversely, if X and Y are 
e-isometric, then dGji{X,Y) < 2e [11]. 


2.3 Ghoice of a metric 

The metric approach we have described and the Gromov-Hausdorff distance do not 
prescribe any particular choice of the metric dx- In general, dx is independent of X 
and can be defined quite arbitrarily. There are, however, two natural choices of dx- 
The first choice is the geodesic metric, measuring the length of the shortest intrinsic 
path between a pair of points, constructing the intrinsic geometry of X. The second 
choice is the extrinsic Euclidean metric, measuring the length of a line in connecting 
two points on X that relates to the extrinsic geometry in which X is embedded, i.e., 
R^ (see also [32]). 

Extrinsic geometry is invariant to rigid transformations of the shape (rotation, 
translation, and reflection), which preserve Euclidean distances. However, nonrigid de¬ 
formations may change the extrinsic geometry (see example in Fig. 1). As a result, the 
Euclidean metric is not suitable for the comparison of shapes with significant bending 
or other type of non-rigid deformations. The intrinsic geometry on the other hand is 
invariant to inelastic shape deformations which do not “stretch” or “tear” the shape. 
As a particular case, it is also invariant to rigid transformations (see also [32]). There¬ 
fore, the geodesic metric is a good choice for comparing non-rigid shapes, as has been 
confirmed by numerous results as those mentioned in the introduction. 

Another important type of transformations a shape can undergo are those changing 
the shape topology. Omitting formal definitions, the topology of X can be thought of as 
a collection of neighborhoods of every point on X. This defines the connectivity of the 
shape - which points can be reached by a small step from a neighbor point. Topology 
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Extriiisically dissimilar 
Intrinsically similar 


Extrinsically similar 
Intrinsically dissimilar 


Extrinsically dissimilar 
Intrinsically dissimilar 


Fig. 1 Illustration of the difference between intrinsic and extrinsic similarity for comparison 
of non-rigid shapes with different topology. 


can change as a result of non-rigid deformations, such as a person bending, fingers 
touching, or a paper folding. Topological dissimilarities between shapes can also be the 
result of noise, typically arising in acquisition of shapes by a 3D scanner. Connectivity 
changes do not change the geometry (location of the points) of the shape, therefore, 
the Euclidean metric is not sensitive (or at least robust) to such transformations. 
Yet, geodesic distances can drastically change as a result of even “minor” connectivity 
changes, since such changes alter the shortest paths between two points on the shape. 

It is therefore important to use a metric which is both intrinsic (and thus invari¬ 
ant to non-rigid deformations) and also invariant (or at least robust) to topological 
changes. In this paper, we propose to use a different metric that while being intrinsic 
and invariant to bends, is also robust to topological changes, thereby simultaneously 
enjoying important properties of both extrinsic and intrinsic geometry. This new dis¬ 
tance is explained next. 


3 Diffusion geometry 

In [13,25] (see also [1] for a related effort), Lafon et al. introduced diffusion maps and 
diffusion distances as a method for data parametrization, embedding, and dimension¬ 
ality reduction. Informally, the diffusion distance between two points on a shape is 
related to the probability of arriving from one point to another in a random walk with 
a fixed number of steps. Since the diffusion distance operates on paths defined on the 
shape, it is an intrinsic property, and as a result, is bending-invariant. Moreover, being 
an average over all possible paths connecting two points, while the geodesic distance 
is the length of just the minimal one, the diffusion distance can be more robust than 
the geodesic distance in cases where topological changes are present. As an illustra- 
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tion, imagine the hand shape and two points on the tips of the index finger and the 
thumb. If the two fingertips are not touching, then both geodesic and diffusion dis¬ 
tances between the two points are large, as all paths connecting the two points travel 
throughout the whole hand. Yet, if the hand is bent in such a way that the fingertips 
touch each other, the minimal geodesic will “re-route” itself through the “shortcut” 
across the fingertips instead of going though the hand, leading to a significant change 
in the geodesic distance. For the diffusion distance, this new path added as a result of 
the topology change is averaged with the other paths, which reduces the effect of such 
a change. Obviously, the lesser sensitivity to topological changes attributed to the aver¬ 
aging property of the diffusion distance, comes at the expense of a potential reduction 
in discriminative power, as it usually happens in a trade off between invariance and 
discriminativity. However, we have not observed any significant loss of discriminativity 
in the experiments in this paper, nor in [29] (see the discussion in Section 6). 

Besides the above properties, the diffusion distance is a metric, and thus, a valid 
candidate for the definition of a metric space used in the Gromov-Hausdorff shape 
model. This stems from the fact that following the diffusion geometry framework, the 
3D shape can be embedded into a (theoretically infinite-dimensional) Euclidean space 
by means of an eigenmap referred to as diffusion map in [13,25[. The Euclidean distance 
in this space equals the diffusion distance measured on the original shape (see next). 

Formally, in order to compute the diffusion distance, we first construct a non¬ 
negative symmetric affinity function k{x, y) > 0 over all pairs of points x, y on the 
shape. In the discrete setting, the shape may be represented as a triangular mesh, 
point cloud, or parametric surface. For the exposition simplicity, we regard the shape 
Y as a finite set of points. The affinity function in this case can be represented as an 
N X N matrix K, where N is the number of samples.^ The matrix is symmetric and 
has non-zero values. 

Next, we define 


p{x,y) := 


k{x,y) 

v{x) 


( 1 ) 


where v{x) := '^y k{x,y). Note that while still positive, p{x,y) is not symmetric any 
longer, therefore, we define a symmetric version, 


p{x,y) :=p{x,y) 


\v{y)) 


( 2 ) 


It is easy to verify that 

'^p{x,y) = l. (3) 

X 

This allows to interpret the N x N matrix P = (p{x,y)) as a transition matrix of 
a random walk (Markov process) on X, where p{x,y) is the probability of transition 
from point a; to y in one step. Using the random walk formulation, the probability of 
transition from x to y in m steps is given by the m-th power of the matrix P. The 
element •) of the matrix P™ can be thought of as a “bump” centered at x and 

of width proportional to m. 

Then, the diffusion distance, defined as 

dL(x,y) := (4) 


In the continuous case, matrices are replaced by operators and vectors by functions. See 
more rigorous definitions in [13,25]. 



can be thought of as a distance between two bumps. It is inversely related to the 
connectivity of points x and y by paths of length m (i.e., if there are many such paths 
connecting x and y, the distance d2m{x,y) is small). 

Using eigendecomposition of P, we can expand p{x, y) as 

N 

P{x,y) = ^\l(l)i{x)(j>i{y), (5) 

i=0 

where Aq = 1 > > Af > ... > A^ are the eigenvalues (note that the eigenvalues are 

squared for convenience), of the matrix P, and (pi are the corresponding eigenvectors. 
Therefore, for the elements of the matrix P™ we obtain 

N 

= ^\i'^(j)i{x)(l)^{y). (6) 


The eigenmap 


<pTnix) := 


XTM^) 


V ; ) 


(7) 


dehned by the eigenvalues and eigenvectors of P, and mapping points from X to the 
Euclidean space, was termed dijfusion map in [13]. It is well known and easy to see 
that the Euclidean distance in the diffusion map space equals to the diffusion distance, 
i.e., 

\\'^m{x)-$m{y)\\ 2 =dl^{x,y). (8) 

As a result, the diffusion distance is a metric and therefore can be used to define a 
valid metric space in our framework. 

For completeness, we should mention that in order to account for non-uniform 
sampling density of X, the affinity function k{x, y) is further normalized. 


k{x,y) = 


k{x,y) 

v{x)v{y)' 


(9) 


and the normalized affinity function k is used instead of k [25]. The choice of k{x,y) 
itself is, to a large extent, arbitrary (in particular when considering experimental re¬ 
sults, while more restricted selections are needed for some of the critical theoretical 
results connecting this to the Laplace-Beltrami operator to hold). Typically, the Gaus¬ 
sian kernel k{x, y) = exp(—jja; — t/jj^/cr^) is used, where a depends on the shape and its 
sampling, e.g., it can be the average of Euclidean distances between all pairs of points 
in the shape [29]. Other kernels can be used as well. For the experiments in this paper, 
we found sufficient to use the kernel k{xi,Xj) = 1 if Xi and Xj are connected and zero 
otherwise (leading to a very sparse matrix K). The time constant m = 50 was used 
and density was assumed to be constant. 
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4 Numerical computation 

In order to make the problem of intrinsic shape similarity (and matching) compnta- 
tionally tractable, our first step consists of the discretization of the surfaces X and Y 
and their corresponding metrics dx and dy- We first assume the shapes to be rep¬ 
resented as discrete approximations X and Y of the underlying continuous smooth 
surfaces X and Y, respectively. In our implementation, triangular meshes were used 
for this purpose, and points on X and Y were represented in barycentric coordinates. 
The geodesic metric was computed using the fast marching algorithm on triangulated 
surfaces [24], while the diffusion metric was computed as described in the previous 
section.^ We denote the discretized metrics on X and Y as dj^ and respectively, 
omitting the type of the metric whenever possible. 

Our next goal is to construct a discrete approximation of a correspondence between 
the shapes and evaluate its distortion. For this purpose, let us fix two sufficiently dense 
finite samplings P = {pi, and Q = {qi, ...,qn} of X and Y, respectively.® A 

discrete correspondence between the shapes is defined as C = (P x Q') U (Q x P'), 
where P' = ...,pJi} and Q' = {qy ...^q'^} are some (different) sets of samples on 

X and Y, respectively, parametrizing the class of all discrete correspondences. 

Given two sets P and P' on X, we can construct an m x n distance matrix D{P, P'), 
whose elements are the distances dj^{pi,p'j). In these terms, the distortion of the cor¬ 
respondence C can be written as 

D{P,P') \_( D{Q',Q') D{Q',Q)\ | 
[D{P,P'f D{P',P')) [D{Q',Qf D{Q,Q) J 

where jj • jj is some norm on the space of (m-l-n) x (m + n) matrices. The selection of the 
infinity norm |lII||oo = max^j \dij\ is consistent with the Gromov-Hausdorff distance, 
however, in practice more robust norms like the Frobenius norm ||II|lp = trace(IIZ)^) 
are often preferable (see [31,35] for discussions on the regularization of the infinity 
norm in the Gromov-Hausdorff framework by other Ip norms). 

The discretization of dis (C) leads directly to a discretized approximation of the 
Gromov-Hausdorff distance between shapes, which can be expressed as 

dGH(A,y) := i mindis(G). 

Note that only and Q' participate as continuous minimization variables, while P 
and Q are constants (given samples on the respective shapes). The above minimization 
problem is solved using GMDS - a numerical procedure resembling in its spirit standard 
multidimensional scaling. The reader is referred to [6] for further computational details. 

^ As described in the introduction, both the geodesic and diffusion distances can be efficiently 
computed for other surface representations as well, including point cloud data. Therefore, the 
framework here introduced is not limited to meshes. 

^ We use the farthest point sampling strategy to produce r-separated/coverings of the shape, 
where the parameter r controls the radius of the sampling (see [35] for details on the rela¬ 
tionships between the Gromov-Hausdorff distance between these discrete coverings and the 
underlying continuous spaces). Such farthest sampling strategy can also be easily applied both 
to meshes and point cloud data. 
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5 Experimental results 


In this section, we present experimental results showing the advantages of using the 
diffusion distance instead of the geodesic one for shape comparison under the metric 
model. 

In the hrst experiment, we tested the sensitivity of shape recognition to point-wise 
topology changes. We used a subset of the TOSCA shape database, [10], consisting of 
four different objects: cat, dog, male, and female. Each of the objects had a reference 
pose as well as a number of near-isometric deformations (2 for the cat, 4 for the dog, 
and 6 for the male and female). For each of the deformations, a version with different 
topology was created by welding some of the vertices of the shape (marked by red 
circles in Figure 2). 

Figure 3 shows the ROC curve (false acceptance rate vs false rejection rate), repre¬ 
senting the accuracy of shape recognition with both geodesic and diffusion distances. 
The equal error rate (EER) is 4% for diffusion distances and 13.5% for geodesic dis¬ 
tances, respectively. Figures 4 and 5 visualize the shape similarity with both distances. 
Note how the diffusion distance leads to a much clear clustering of the four classes. 

Figure 6 shows the correspondence between different deformations of the cat object, 
obtained as a byproduct of GMDS with geodesic and diffusion distances, respectively. 
To quantify the correspondence quality, we measured the Li distance between the 
computed correspondence C = (P x Q')U(Q x P') and the groundtruth correspondence 
Co = (Po X Qo) U {Qo X Pq), defined as 


d{C, Co) 


1 f \\D{P',Plo)\\i HP(Q',Q;,)||i \ 

2 \ • diam(A') • diam(y) J 


where the distance matrices D and the diameters are understood in the sense of the 
geodesic or the diffusion metrics. The achieved correspondence quality is summarized 
in the following table: 


Geodesic Diffusion 

Without topology changes 0.0514 0.0268 

With topology changes 0.1586 0.0291 


The correspondence appears to be accurate and satisfactory visually for both distances 
when no topological changes are present. However, as the result of topological changes, 
the correspondence obtained using geodesic distances deteriorated approximately three 
times, whereas the the correspondence computed using diffusion distances deteriorates 
by less than 10%. 

In the second set of experiments, we tested the sensitivity of shape recognition 
to holes. Such topological changes are often encountered as acquisition imperfections 
when the shapes are acquired using a 3D scanner. We used a subset of the TOSCA 
shape database, consisting again of four different objects: cat, dog, male, and female. 
Each of the objects had a reference pose and two near-isometric deformations. Each 
deformation, in turn, had two versions: with 10 and 20 holes (see Figure 7). 

Figures 8 and 9 visualize the shape similarity with both methods. Note for example 
how the men and women are much better separated with the diffusion distance. The 
EER in this experiment is 0% for diffusion distances and 6.8% for geodesic distances. 
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Fig. 2 The shapes used in the first set of experiments. Red circles indicate point-wise topo¬ 
logical changes. 


6 Conclusions 

In this paper, we addressed the problem of shape recognition in the presence of topo¬ 
logical changes. We used the metric approach, modeling shapes as metric spaces and 
posing the problem of shape similarity as the similarity between metric spaces. We 
showed that the Gromov-Hausdorff distance, previously applied to geodesic metrics for 
bending-invariant shape recognition, can be applied to shapes endowed with diffusion 
geometry, leading to a topologically robust approach for non-rigid 3D shape comparison 
and matching. In particular, we showed how replacing the geodesic distance between 
pair of surface points by the diffusion distance, leads to recognition improvements for 
data with topological variations such as holes and connectivity changes. This robust¬ 
ness to holes is a first step toward the recognition of partial shapes, since the missing 
portion can be considered as a “hole.” 
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Fig. 3 ROC curves for the first experiment. The EER is 4% for diffusion distances and 13.5% 
for geodesic distances. 


In addition to the practical consequences brought by the proposed framework, this 
work opens the door to moving beyond the classical use of geodesics for intrinsic non- 
rigid shape matching. Thereby, the use of other intrinsic distances, as well as other 
kernels in the diffusion framework, deserves further study. The combination of such 
distances may lead to further performance improvements as well. The combination of 
the Gromov-Hausdorff-Diffusion framework with topological features as those described 
in [43] is of great practical interest as well. 

At a theoretical level, many existing questions are emphasized and new ones are 
posed. The geometry of the shape space defined by the Gromov-Hausdorff metric with 
diffusion distances is of great theoretical and practical significance. In addition, the 
study of the classes of shape transformations under which the diffusion geometry is 
invariant (“diffusion isometries”) and their relation to “geodesic isometries” should 
give an important insight on the cases in which each of the methods is preferable. 
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Fig. 4 Shape recognition results using GMDS with geodesic distances. The plot was obtained 
by applying multidimensional scaling to the matrix of pairwise Gromov-Hausdorff-Geodesic 
distances between the shapes. Hollow circles indicate shapes with point-wise topological noise. 
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Fig. 5 Shape recognition results using GMDS with diffusion distances. The plot was obtained 
by applying multidimensional scaling to the matrix of pairwise Gromov-Hausdorff-Diffusion 
distances between the shapes. Hollow circles indicate shapes with point-wise topological noise. 



Geodesic 


Diffusion 


Fig. 6 Gorrespondence between the reference cat shape (left) and its isometric deformation 
found using GMDS with geodesic (middle) and diffusion (right) distances. First row: same 
topology; second row: different topology obtained by welding the vertices at points marked 
by red circles. The correspondence is visualized by showing Voronoi diagrams between corre¬ 
sponding points. 
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